NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Parallel median consensus clustering in complex networks

https://doi.org/10.1038/s41598-025-87479-6

Hussain, Md Taufique; Halappanavar, Mahantesh; Chatterjee, Samrat; Radicchi, Filippo; Fortunato, Santo; Azad, Ariful (December 2025, Scientific Reports)

We develop an algorithm that finds the consensus among many different clustering solutions of a graph. We formulate the problem as a median set partitioning problem and propose a greedy optimization technique. Unlike other approaches that find median set partitions, our algorithm takes graph structure into account and finds a comparable quality solution much faster than the other approaches. For graphs with known communities, our consensus partition captures the actual community structure more accurately than alternative approaches. To make it applicable to large graphs, we remove sequential dependencies from our algorithm and design a parallel algorithm. Our parallel algorithm achieves 35x speedup when utilizing 64 processing cores for large real-world graphs representing mass cytometry data from single-cell
more » « less
Full Text Available
SparseTransX: Efficient Training of Translation-Based Knowledge Graph Embeddings Using Sparse Matrix Operations

Anik, Md_Saidul_Hoque; Azad, Ariful (May 2025, Eighth Conference on Machine Learning and Systems (MLSys))

Knowledge graph (KG) learning offers a powerful framework for generating new knowledge and making inferences. Training KG embedding can take a significantly long time, especially for larger datasets. Our analysis shows that the gradient computation of embedding is one of the dominant functions in the translation-based KG embedding training loop. We address this issue by replacing the core embedding computation with SpMM (Sparse-Dense Matrix Multiplication) kernels. This allows us to unify multiple scatter (and gather) operations as a single operation, reducing training time and memory usage. We create a general framework for training KG models using sparse kernels and implement four models, namely TransE, TransR, TransH, and TorusE. Our sparse implementations exhibit up to 5.3x speedup on the CPU and up to 4.2x speedup on the GPU with a significantly low GPU memory footprint. The speedups are consistent across large and small datasets for a given model. Our proposed sparse approach can be extended to accelerate other \revise{translation-based (such as TransC, TransM, etc.) and non-translational (such as DistMult, ComplEx, RotatE, etc.) models as well.
more » « less
Full Text Available
Predicting Interactions in the Weapons of Mass Destruction Knowledge Graphs. In this paper, we apply graph machine learning methods to predict unseen interactions within the Weapons of Mass Destruction (WMD) dataset, developed by DARPA and IARPA. This dataset captures complex online activities, including sales, purchases, and forum discussions, with a focus on topics such as weapons, explosives, and other sensitive subjects. We represent the data as a knowledge graph, where nodes correspond to entities and edges denote relationships between them. Among various knowledge graph embedding techniques and graph neural networks, semantic matching models like DistMult demonstrate the ability to accurately predict 84% of relations, particularly due to their strength in capturing the one-to-many relationships common in the WMD data. To streamline the analysis, we implement an automated pipeline that stores the knowledge graph in a Neo4j database, extracts subgraphs using Cypher queries, trains knowledge graph embedding models on these subgraphs, predicts links, and reintegrates high-confidence edges back into the main graph.

https://doi.org/10.1007/978-3-031-82439-5_18

Agrawal, Abhigya; Anik, Md_Saidul Hoque; Azad, Ariful (January 2025, Springer Nature Switzerland)

Full Text Available
Batch Updates of Distributed Streaming Graphs using Linear Algebra

https://doi.org/10.1109/SCW63240.2024.00089

Hassani, Elaheh; Hussain, Md Taufique; Azad, Ariful (November 2024, IEEE)

We develop a distributed-memory parallel algorithm for performing batch updates on streaming graphs, where vertices and edges are continuously added or removed. Our algorithm leverages distributed sparse matrices as the core data structures, utilizing equivalent sparse matrix operations to execute graph updates. By reducing unnecessary communication among processes and employing shared-memory parallelism, we accelerate updates of distributed graphs. Additionally, we maintain a balanced load in the output matrix by permuting the resultant matrix during the update process. We demonstrate that our streaming update algorithm is at least 25 times faster than alternative linear-algebraic methods and scales linearly up to 4,096 cores (32 nodes) on a Cray EX supercomputer.
more » « less
Full Text Available
Industrial energy forecasting using dynamic attention neural networks

https://doi.org/10.1016/j.egyai.2025.100504

Majeske, Nicholas; Vaidya, Shreyas Sunil; Roy, Ryan; Rehman, Abdul; Sohrabpoor, Hamed; Miller, Tyson; Li, Wenhui; Fiddyment, CR; Gumennik, Alexander; Acharya, Raj; et al (May 2025, Energy and AI)

We develop a comprehensive framework for storing, analyzing, forecasting, and visualizing industrial energy systems consisting of multiple devices and sensors. Our framework models complex energy systems as a dynamic knowledge graph, utilizes a novel machine learning (ML) model for energy forecasting, and visualizes continuous predictions through an interactive dashboard. At the core of this framework is A-RNN, a simple yet efficient model that uses dynamic attention mechanisms for automated feature selection. We validate the model using datasets from two manufacturers and one university testbed containing hundreds of sensors. Our results show that A-RNN forecasts energy usage within 5% of observed values. These enhanced predictions are as much as 50% more accurate than those produced by standard RNN models that rely on individual features and devices. Additionally, A-RNN identifies key features that impact forecasting accuracy, providing interpretability for model forecasts. Our analytics platform is computationally and memory efficient, making it suitable for deployment on edge devices and in manufacturing plants.
more » « less
Full Text Available
Distributed-Memory Parallel Algorithms for Sparse Matrix and Sparse Tall-and-Skinny Matrix Multiplication

https://doi.org/10.1109/SC41406.2024.00052

Ranawaka, Isuru; Hussain, Md Taufique; Block, Charles; Gerogiannis, Gerasimos; Torrellas, Josep; Azad, Ariful (November 2024, IEEE)

We consider a sparse matrix-matrix multiplication (SpGEMM) setting where one matrix is square and the other is tall and skinny. This special variant, TS-SpGEMM, has important applications in multi-source breadth-first search, influence maximization, sparse graph embedding, and algebraic multigrid solvers. Unfortunately, popular distributed algorithms like sparse SUMMA deliver suboptimal performance for TS-SpGEMM. To address this limitation, we develop a novel distributed-memory algorithm tailored for TS SpGEMM. Our approach employs customized 1D partitioning for all matrices involved and leverages sparsity-aware tiling for efficient data transfers. In addition, it minimizes communication overhead by incorporating both local and remote computations. On average, our TSSpGEMM algorithm attains 5x performance gains over 2D and 3D SUMMA. Furthermore, we use our algorithm to implement multi-source breadth-first search and sparse graph embedding algorithms and demonstrate their scalability up to 512 Nodes (or 65,536 cores) on NERSC Perlmutter.
more » « less
Full Text Available
Distributed-Memory Parallel Algorithms for Sparse Matrix and Sparse Tall-and-Skinny Matrix Multiplication

Ranawaka, Isuru; Hussain, Md Taufique; Block, Charles; Gerogiannis, Gerasimos; Torrellas, Josep; Azad, Ariful (November 2024, International Conference for High Performance Computing, Networking, Storage and Analysis SC)

Full Text Available
Scalable Node Embedding Algorithms Using Distributed Sparse Matrix Operations

https://doi.org/10.1109/IPDPSW63119.2024.00205

Ranawaka, Isuru; Azad, Ariful (May 2024, IEEE)

Full Text Available
GNNShap: Scalable and Accurate GNN Explanation using Shapley Values

https://doi.org/10.1145/3589334.3645599

Akkas, Selahattin; Azad, Ariful (May 2024, ACM)

Graph neural networks (GNNs) are popular machine learning models for graphs with many applications across scientic domains. However, GNNs are considered black box models, and it is challenging to understand how the model makes predictions. Game theoric Shapley value approaches are popular explanation methods in other domains but are not well-studied for graphs. Some studies have proposed Shapley value based GNN explanations, yet they have several limitations: they consider limited samples to approximate Shapley values; some mainly focus on small and large coalition sizes, and they are an order of magnitude slower than other explanation methods, making them inapplicable to even moderate-size graphs. In this work, we propose GNNShap, which provides explanations for edges since they provide more natural explanations for graphs and more ne-grained explanations. We overcome the limitations by sampling from all coalition sizes, parallelizing the sampling on GPUs, and speeding up model predictions by batching. GNNShap gives better delity scores and faster explanations than baselines on real-world datasets. The code is available at https://github.com/HipGraph/GNNShap.
more » « less
iSpLib: A Library for Accelerating Graph Neural Networks using Auto-tuned Sparse Operations

https://doi.org/10.1145/3589335.3651528

Hoque_Anik, Md Saidul; Badhe, Pranav; Gampa, Rohit; Azad, Ariful (May 2024, ACM)

Full Text Available

« Prev Next »

Search for: All records